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Abstract. A desirable property of an autocovariance estimator is to be robust to the pres- 
ence of additive outliers. It is well-known that the sample autocovariance, being based 
on moments, does not have this property. Hence, the use of an autocovariance estimator 
which is robust to additive outliers can be very useful for time-series modeling. In this pa- 
pe r, the asymptotic properties o f the robust scale and autoc ovariance estimators proposed 
by iRousseeuw and Croux |l993h and iMa and Gentonl (|200oh are established for Gaussian 
processes, with either short-range or long-range dependence. It is shown in the short-range 
dependence setting that this robust estimator is asymptotically normal at the rate y/n, where 
n is the number of observations. An explicit expression of the asymptotic variance is also 
given and compared to the asymptotic variance of the classical autocovariance estimator. 
In the long-range dependence setting, the limiting distribution displays the same behavior 
than that of the classical autocovariance estimator, with a Gaussian limit and rate *Jn when 
the Hurst parameter H is less 3/4 and with a non-Gaussian limit (belonging to the second 
Wiener chaos) with rate depending on the Hurst parameter when H £ (3/4, 1). Some Monte- 
Carlo experiments are presented to illustrate our claims and the Nile River data is analyzed 
as an application. The theoretical results and the empirical evidence strongly suggest using 
the robust estimators as an alternative to estimate the dependence structure of Gaussian 
processes. 



1. Introduction 



The autocovariance function of a stationary process plays a key role in time series analysis. 
However, it is well known that the classical sample autocovariance function is very sensitive 
to the presence of additive outliers in the data. A small fraction of additive outliers, in some 
cases even a single outlie r, can affect the classical auto c ovaria n ce estimate making it virtually 



usele ss; see for instance iDeutsch et al.1 (|1990l ) IChanl (j 19921 ). IChanl (|1995l ) (jMaronna et al 
20061 . Chapter 8) and the references therein. Since additive outliers are quite common in 



practice, the definition of an autocovariance estimator which is robust to the presence of 
a dditive outliers is an im portant task. 

Ma and Gentonl (120001 ) proposed a robust estimator of the autocovariance function and 



dis cussed its perf o rman ce on synthetic and real data sets. This estimator has later been used 
bv lFajardo et al.1 (|2009l ) to derive robust estimators for ARMA and ARFIMA models. 
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The autocovariance estimator proposed b y iMa and Gentonl (|2000l ) is based on a method 
due to iGnanadesikan and Kettenrind (|1972l ) , which consists in estimating the covariance of 
the random variables X and X' by comparing the scale of two appropriately chosen linear 
combinations of these variables; more precisely, if a and b are non-zero, then 



Cov(X, X') 



— \Xai(aX + bX') 
Aab 1 



Var(aX - bX')} 



(1) 



Assume that S is a robust scale functional; we write for short S(X) = S(Fx), where Fx is 
the c.d.f o f X and assum e that S is affine equivariant in the sense that S(aX + b) = \a\S(X). 
Following iHuberl (|l98lh . if we replace in the above expression Var(-) by S 2 (-), then (pTJ) is 
turned into the definition of a robust alternative to the covariance 



C S (X,X') 



— {S 2 {aX + bX') 
4ab 1 



S 2 (aX-bX')} 



(2) 



The constants a and b can be chosen arbitrarily. If X and X ' have the same scale (e.g. the 



same marginal distribution), one could simply take a = b = 1. IGnanadesikan and Kettenring 
(|1972l ) suggest to take a and b proportional to t he inverse of S (X) and S(X'), respectively in 
order to standardize X and X'. As explained in IHuberl (|198lh . if S is standardized such that 
S(X) = 1 in the case where X is standard Gaussian, then, provided that (X, X') is bivariate 
normal, 

C s (X,X') = Cov(X,X') . (3) 

In this case indeed, aX + bX' and aX — bX' are Gaussian random variables with variance 
a\ = a 2 Var(X) ± 2a6Cov(X, X') + b 2 Var(X'), and so, if Y ~ Af(0, 1), then S{aX ± bX') = 
S(a±Y) = a±S(Y) = a± and S 2 {aX + bX') - S 2 (aX - bX') = a\ - <r 2 _ = 4a6Cov(A, X') 





Ma and Genton 


(2000) sue 


Rousseeuw and Croux ( 


199S|) 



correlation integral, defined as 

r i— ► U(r, Fx ) 



J j ^{\x-x'\<r}dFx(x)dF x (x') , 



(4) 



which measures the probability that two independent copies X and X' distributed accord- 
i ng to Fx fall at a distance smaller than r. The robust scale estimator introduced in 
(jRousseeuw and Crouxl . Il993l . p. 1277) defines the scale Q(Fx) of a c.d.f. Fx as being 
proportional to the first quartile of r i— > U (r, Fx), namely, 



Q(F X ) = c(F x )mi{r > 0,U(r,F x ) > 1/4} 



(5) 



where c(Fx) is a constant depending only on the shape of the c.d.f. Fx- We see immediately 
that Q{Fx) is affine invariant, in the sense that transforming X into aX + b, will multiply 
Q(Fx) by \a\ . This scale can be seen as an analog of the Gini average difference estimator 
n~ l (n — l)^ 1 X^i<j^j<n \Xi — Xj\, where the average is replaced by a quantile. It is worth 
noting that instead of measuring how far away the observations are from a central value, 
Q(Fx) computes a typical distance between two independent copies of the random variable X, 
which leads to a reasonable estimation of the scale even when the c.d.f. Fx is not symmetric. 
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The constant c(Fx) in ([5j) is there to ensure consistency. In the sequel, the c.d.f. Fx is 
assumed to belong to the Gaussian location-scale family 

{$^(0 = $((• - n)/a), fi E R, a E IK;} , (6) 

where $ is the c.d.f. of a standard Gaussian random variable. The reason we focus on the 
Gaussian family is that if we want to use Q as the scale S in ([2]), we will need to compute 
c(F a x+bX') and c(F a x-bX')- This is easily done when (X, X') is a Gaussian vector. Indeed, 
in view of ([3]), one has 

Cav(X,X') = - A lQ\F x+ x>) ~ Q 2 {F X _ X ,)] , (7) 

and in particular, since by @ and ©, Q 2 {F 2 x) = (2Q(Fx)) 2 , 

Var(X) = Q\F X ) . (8) 
When F x = rr we can t hen obtain the constant c(3>^ )0 -) in ([5]) explicitly as noted by 



Rousseeuw and Croux ( 19931 ). Since Q( < I ) Ati(7 ) = cr, © becomes 

f = Q(*/t,<r) = c($ m , ct )ot (9) 

where r is such that, in (gj), C/(r , $) = 1/4. Hence for all (fi,a) eRx R^, 

c($ M , CT ) = c($) = l/r = l/(V2$- 1 (5/8)) = 2.21914 . (10) 

Let (Xi)i>i be a stationary Gaussian process. Given the observations X\ :n = (X\, . . . , X n ), 
the c.d.f. of the observations may be estimated using the empirical c.d.f. r ^> F n (r) = 
n~ l J2i=i ^{x,<r}- Plugging F n into ([5]) leads to the following robust scale estimator 

Qn (X 1:n , = c(&){\Xi - Xj\; l<i,j<n} {kn) , (11) 

where k n = [n 2 /4:\. That is, up to the multiplicative constant c(&), Q n {X\- n , $) is the A; n th 
order statistics of th e n 2 distances \Xj — Xj \ betw een all the pairs of observations. 

As mentioned by Rousseeuw and Croux ( 19931 ). Q n (Xi :n ,$) has several appealing prop- 
erties: it has a simple and explicit formula with an intuitive meaning; it has the highest 
possible breakdown point (50%); in addition, the associated influence function (see below) 
is bounded. For a d efinit ion of these quantities, which are classical in robust statistics, see 
for instance iHuberl (|198ll ) . The scale estimator of Rousseeuw and Croux is also attractive 
because it can be implemented very efficiently; it can be comp uted with a time-complexity o f 
order 0(n log n) and with a storage scaling linearly 0(n); see ICroux and Rousseeuw! (jl992l ) 
for implementation details. 

Using the robust scale estimator Q n (•, $) in (|11|) and the identity (J2J) with a = b = 1, the 
robust autocovariance estimator of 

7(h) = CovpTi, X h+ i) = ^{Var(X! + X h+1 ) - \ & x{X l - X h+1 )} 

is 

%(h,X lm ,$) = j{Ql_ h (X 1:n - h + X h+1:n ,$)-Ql_ h (X 1:n - h -X h+1:n ,$)} . (12) 

Thus, in the sample version (|12p . the random variable X\ ± X^+i is replaced by the vector 
Xx-.n-h ± Xh+im of length n-h. 
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In this paper, we establish the asymptotic properties of Q n {X\ :n , <£>) and the corresponding 
robust autocovariance estimator jQ(h,Xi. n ,&) for Gaussian processes displaying both short- 
range and long-range dependence. We say that the process is short-range dependent if the 
autocovariance function {7(&)}fcez is absolutely summable, ^fcez llWI < oo. We say that 
it is long-range dependent if the autocovariance function is regularly varying at infinity with 
exponent D, j(k) = k~ D L{k) with < D < 1 and L is a slowly varying function, i.e. 
linifc^oo L(ak)/L(k) for any a > 0, and is positive for large enough k. The exponent D is 
related to the so-called Hurst coefficient by the relation H = 1 — D/2. See, for more details, 
(|Doukhan et all hooi . p. 5-38). 

The limiti ng distribution s of th ese estimators are obtained by using the functional delta 
method; see van der Vaartl ( 19981 ). In the short memory case, the results stems directly 
from the weak invariance principle satisfied by the empirical process F n under mild technical 
assumptions. The rate of convergence of the robust covariance estimator is y/n and the 
limiting distribution is Gaussian; an explicit expression of the asymptotic variance is given in 
Theorem 21 

In the long memory case, the situation is more involved. When D > 1/2 (or H < 3/4), 
the rate of convergence is still y/n, the limiting distribution is Gaussian and the asymptotic 
variance of the covariance estimator is the same as in the short-memory case. When < D < 
1/2, the rate of convergence becomes equal to n D /L{n) where L is a slowly varying function 
defined in (|38|) ; the limiting distribution is non-Gaussian and belongs to the second Wiener 
Chaos; see Theorem [8) We prove that these rates are identical to the ones of the classical 
autocovariance estimators. 

The study of the asymptotic distribution of the empirical process is not enough to derive 
these results. It is necessary to use results on the empirical version of the correlation integral 
which require s extensions of the result s derived for [/-processes under short-range dependence 
conditions bv lBorovkova et al.l (|200ll ). For this part, we use novel results on [/-processes o f 
long- memory time-series that are developed in a companion paper iLevv-Leduc et al.l (| 20091 ). 

The outline of the paper is as follows. In Section [51 the limiting distributions of the robust 
scale estimator Q n {X\ :n , <J>) in the Gaussian short-range and long-range dependence settings 
are proved. From these results, the asymptotic distribution of 7q(/i, X± :n , <&) is derived. In 
Section [31 some Monte-Carlo experiments are presented in order to support our theoretical 
claims. The Nile River data is studied as an application in Section [H Section [5] is dedicated 
to the asymptotic properties of [/-processes which are useful to establish the results of Section 
[2] in the long-range case. Sections [6] and [7] detail the proofs of the theoretical results stated 
in Section [2j Some concluding remarks are provided in Section [8l 



Notation. For an interval / in the extended real line [—00,00], we denote by D(J) the set 
of all functions z : / — > R that are right-continuous and whose limits from the left exist 
everywhere on I. We always equip D(I) with the uniform norm, denoted by || ■ H-q- We denote 
by A^([— 00, 00]) the set of cumulative distribution functions on [—00, 00] equipped with the 
topology of uniform convergence. For U £ D(-[), let U~ 1 denote its generalized inverse, 
U~ l (ri) = inf{r £ /, U(r) > r]}. 
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The convergence in distribution in (D([0, oo]), (Hl^) is meant with respect to the cr-algebra 

generated by the set of open balls. We denote by — — > the convergence in distribution. 

We denote by $ the c.d.f of the standard Gaussian random variable and by <f> the corre- 
sponding density function. 

2. Theoretical results 

Define the following mappings: 

Ti : M([-oo,oo]) ->D([0,oo]) 

F I f l {lx _ yl < r} dF(x)dF(y)\ , (13) 

T 2 : D([0,oo]) — > R 

U i — ► C/ _1 (l/4) . (14) 

and 

T = T 2 oT 1 : M([-oo,oo])) — > R (15) 
F i — ► C/" 1 ! 1 / 4 ) • (16) 
Then, the scale estimator Q n (Xi :n , $) introduced in (fTTj) may be expressed as 

Q n (X 1:n ,$) = c($)r (F n ), (17) 
where F n is the empirical c.d.f. based on X\. n . 

2.1. Short-range dependence setting. 

2.1.1. Properties of the scale estimator. The following lemma gives an asymptotic expansion 
for Q n (Xi :n , $), which is used for deriving a Central Limit Theorem (Theorem[2]). It supposes 
that the empirical c.d.f. F n , adequately normalized, converges. 

Lemma 1. Let (Xj)j>i be a stationary Gaussian process. Assume that there exists a non- 
decreasing sequence (a n ) such that a n (F n — ^^ a ) converges weakly in (D([0, oo]), || • Hoo) ■ Then, 
Q n (Xi ln , 3>) defined by (jlip has the following asymptotic expansion: 

n 

a n (Q n (X 1:n ,<f>)-a) = ^VlF(X i ,Q,$ Ati(T ) + 0p (l) , (18) 

n 

i=l 

where, for all x in R, 

lF(x, Q, = aIF((x - (i)/a, Q, $) , (19) 

cm<i 

IF(x Q $) = c(*) f l/4-^ + l/cW) + ^-l/c W ) \ 
The proof of Lemma [1] is given in Section [6] 



6 



C. LEVY-LEDUC, H. BOISTARD, E. MOULINES, M. S. TAQQU, AND V. A. REISEN 



Remark 1. Note that IF(x,Q,&) has the same expression as the influence f unction of the 
functional Q eva l uated at the c.d.f. $ given by (jRousseeuw and Crouxl . Il993l . p. 1277) and 
(|Ma and Gentonl . |200CL p. 675). As is well-known from (|Hubej Il98ll . p. 13), the influence 
function x i— ► TF(x,T,F) is defined for a functional T at a distribution i* 1 at point x as the 
limit 

IF(x, T, F) = lim e~ l {T{F + e(6 x - F)) - T(F)} , 

£-+0 + 

where S x is the Dirac distribution at x. Influence functions are a classical tool in robust sta- 
tistics used to understand the effect of a small contamination at the point x on the estimator. 

We focus here on the case where the process (Aj)j>i satisfies the following assumption: 

(Al) (Xj)j>i is a stationary mean-zero Gaussian process with autocovariance sequence 
j(k) = E(X 1 X k+1 ) satisfying: 



Ei#)i 



< oo 



k>l 



To state the results, we must first define the Hermite rank of the influence function x i— > 
IF(x,<5,$). Let {-fffc} denote the Hermite polynomials having leading coefficient equal to 
one. These are Hq(x) = 1, H\{x) = x, H2(x) = x 2 — 1, • • • . Let / be a function such that 
f f 2 (z)d&(z) < oo. The expansion of / in Hermite polynomials is given by 



/(*) 



(21) 



?=r(/) 



where ct q (f) = J f(z)H q (z)d&(z) and where the convergence is in L 2 (M,$). The index of 
the first non zero coefficient in the exp ansion, denoted t(/), is called the Hermite rank of the 
function /. ( Breuer and Maiorl . Il983l . Theorem 1) shows that if 



E WW 



r(f) 



< OO , 



(22) 



h=— oo 



then the variance Var (n Y^7=i /(-^O) converges as n goes to infinity to a limiting value 
<7 2 (/) which is given by 



a 2 (/) = Var [/(A x )] + 2 E Gov [/(A h+1 ), /(Ax)] 



h=l 



E 



'(0) + 2^7 9 (M 



(23) 



In addition, the renormalized partial sum is asymptotically Gaussian, 



n 



-1/2 



E/(^)^AA(0,a 2 (/)) 



(24) 
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Concerning the empirical process, ICsorgo and Mielniczukl (jl996l ) proved that if 

oo 

£ 1-rWI < ~ , (25) 

h=—oo 

then - v /n(F n (-) — <1>o,(t( - )) converges in D([— oo, oo]) to a mean- zero Gaussian process W(-) with 
covariance 

E (W(r)W(r')) = £ L( ) + 2 f>^)| , 

where J g (r) = J[l{ - a: <r} ~~ ^o^M^ijO^d^KzO for all r in [—00,00]. These results are used 
to prove the following theorem in Section [6l 

Theorem 2. Under Assumption (AH^, Q n (Xi :n ,$) defined by Ul\) . satisfies the following 
central limit theorem: 

v^(Q„ (X lM , *) - a) AA(0, ? 2 ) , 



where a = y7(0) and the limiting variance a 2 is given by 

a 2 = 7 (0)E[IF 2 (X 1 / f T, Q, $)] + 2 7 (0) £ E[IF(Xi/a, Q, $)W(X k+1 /a, Q, *>)] , (26) 

fc>l 

IF(-,Q,$) 6em# defined in (|20|) . 

It is interesting to compare, under Assumption (AH]), the asymptotic distribution of the 
proposed estimator Q n {X\ :n , <3?) with that of the square root of the sample variance 

1 " 1 

fe=l v 7 l<i^J<" 

where X n = n" 1 ^"=1 ^i- 

Proposition 3. Under Assumption (AH^, 

Vn (a n>x - a) — > A/"(0, a 2 ; ) , 

u;/iere 

^ = (27(0))" 1 (7(0) 2 + 2^ 7 (fc) 2 ). (28) 

fe>i 

T/ie relative asymptotic efficiency S^/ir 2 of the estimator Q n (Xi ;n ,<3>) compared to a n .x is 
larger than 82.27%. 

The index "cl" stands for "classical" . The proof of Proposition [3] is given in Section [6l 
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2.1.2. Properties of the autocovariance estimator. In this section, we establish the limiting 
behavior of the autocovariance estimator given, for < h < n, by 

%{h, X lM , *) = - [Cg_ h (X lin - h + X h+l:n , $) - Q 2 n _ h (X lm - h - X h+1:n , $)] . (29) 

Theorem 4. Assume that (j4QP holds and let h be a non negative integer. Then, the autoco- 
variance estimator 7q(/i, Xi ;n ,$) satisfies the following Central Limit Theorem: 



^(7q(h,X 1:oi $)-7(ft))AAf(0^) , 
where 

a\h) =E[ t p 2 (X 1 ,X 1+h )} +2j2n^ 1 ,X 1+h )^(X k+1 ,X k+1+h )] , (30) 

it>i 

and the function tp is defined by 
tp : (x,y) i ^ 



(31) 

where IF is defined in (|20p . 

The proof of Theorem 2] is given in Section [6) 

Remark 2. Note that has the same expression as the influence function of 7q(-) given in 
(|Ma and Gentonl . I2OO0L p. 675). 

Remark 3. Let us now compare under Assumption (AH]) the asymptotic distribution of the 
proposed estimator with the classical autocovariance estimator defined by 

n—h 

y(h) = n" 1 Y,( x i ~ X n ){X l+h - X n ), < h < n - 1 . (32) 

i =l 

Under (AQ]), applying ( Arconesl . 1 1994 . Theorem 4) to / : (x,y) ^ xy and Xj = (Xj, Xj + h), 
where h is a non negative integer, leads to the following result. 

Proposition 5. For a given non negative integer h, as n — > 00, 

^i(j(h)- 7 (h))^M(0,a 2 d (h)), 

where 

*ci(h) = 7 2 (0) + 7(h) 2 + 2 £ 7 2 (A0 + 2 £ 7 (A + h) 7 (k - h) . (33) 

k>l k>\ 

Let us now compare a (h) in ([30D with o^(fe) in flJSJ). Since the theoretical lower bound 
for the asymptotic relative efficiency (ARE) defined by ARE(/i) = <t^(/i)/<j 2 (/i) is difficult to 
obtain, the estimation of ARE was calculated in the case where (^Q)i>i is an AR(1) process: 
Xi = <^iXj_x + Ei, where (£i)i>\ is a Gaussian white noise, for fa = 0.1, 0.5 and 0.9. These 
results are given in Figure which displays ARE for h = 1,. . . ,60. From this figure, we can 
see that ARE ranges from 0.82 to 0.90 which indicates empirically that the robust procedure 
has almost no loss of efficiency. 
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FIGURE 1. ARE for an AR(1) process for different values of <j>v- 0.1, 0.5 and 0.9 from left to right. 



2.2. Long-range dependence setting. In this section, we study the behavior of the ro- 
bust scale and autocovariance estimators Q n (Xi :n , $) and 7q(/i, X± :n , $) in (fTT|) and ([29]) 
respectively, in the case where the process is long-range dependent. Long-range dependent 
processes play a key role in many domains, and it is therefore worthwhile to understand the 
behavior of such estimators in this context. 

(A2) (Xj)j>i is a stationary mean-zero Gaussian process with autocovariance 7(fc) = ^{X\Xk + i) 
satisfying: 

7 (jfe) = k~ D L(k), < D < 1 , 
where L is slowly varying at infinity and is positive for large k. 

A classical model for long memory process is the so-called ARFIMA(p, d, q), which is a natural 
generalization of standard ARIMA(j>, d, q) models. By allowing d to assume any value in 
(-1/2,1/2), a fractional ARFIMA model is defined by - B) d Xi = Q(B)Zi. Here 

(Zj)j g z is a white Gaussian noise, B denotes the backshift operator, <&(B) defines the AR- 
part, &(B) defines the MA part of the process, and (l-B) d = Y!k=o (k)(~ B ) k is the fractional 
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difference operator. For d ^ 0, one has 



D = 1 - 2d 



(34) 



(see (6.6) of iTaqgul (|1975l )). For d = 0, we obtain the usual ARMA model. Long memory 
occurs for d > 0. As k — ► oo, the autocovariance of an AR FIMA(p, d, q) decrease s as j(k) = 
Ck 2d ~ x . Such processes satisfy (42]) with D = 1 - 2d, see (|Doukhan et all I2003I . Chapter 1) 
for example for more details. 

Perhaps surprisingly, the proof of the asymptotic properties of Q n (Xi :n , $) in the long- 
range dependence framework does not follow the same steps as in the short-range dependence 
case. 



To understand why, assume that Assumption (A[2J) holds with 7(0) = 1. (jDehling and Taqqu . 



19891 . Theorem 1.1) shows that the difference between the empirical distribution function F n 
and <&, the c.d.f. of the standard Gaussian distribution M(0, 1) renormalized by nd^ 1 , i-e. 
nd~ 1 (F n — <E>), converges in distribution to a Gaussian process in the space of cadlag functions 
equipped with the topology of uniform convergence. The sequence d n depends on the expo- 
nent D governing the decay of the autocorrelation function to zero and also on the slowly 
varying function L appearing in (AE|: more precisely, 



d n = a(D) 



l ' 2 n l - 



D/2 



L l ' 2 {n) 



(35) 



with a(D) = 2(1 - D)~ l {2 - D)~ l for D in (0, 1) defined in (J^. Therefore, Lemma Q] shows 
that the asymptotic expansion of a n (Q n (Xi :n , <£) — 1) in (fT8|) remains valid with a n = nd~ l , 
and that it remains to study the convergence of d~ l YH=i IFpQ, Q, $). This type of non- 
l inear func tional of stationary long- memory Gaussian sequences have been studied in I Taqqu 
(|l975h and lBreuer and Maiorl (jl98^ . The limiting behavior of these funct ionals depend both 
on D and o n the Hermit e rank of the function IF(-, Q, $). According to iBreuer and Maior 
(j 19831 ) and ITaqgul (|1975l ). under Assumption (42]), two marke dly different behav i or ma y 
occur, depending on the value of D. If D G (1/2,1), then, bv IBreuer and Maiorl jl983h . 
n" 1 / 2 Ya=i IF(Xj, Q, $) converges to a zero-mean Gaussian random variable with finite vari- 
ance. If D G (0, 1/2), then n D ~ l L~ l (n ) y~!"- i IF(Xj, Q, <I>) converges to a non degenerate (non 



Gaussian) random variable, see Tag qui ( 1975 ). From these two results and (|35|) . it follows that 



d~ 1 ^2lF(X i ,Q,^) = o P (l) , 



i=l 

for D ^ 1/2. Therefore, the leading term in the expansion of Q n (X\ :n , $) — 1 in the short- 
memory setting is no longer the leading term in the long-memory case. 

This explains why the proof, in the long-memory case, does not follow the same line of 
reasoning as that in the short-range dependence case. To derive the asymptotic properties of 
Qn {Xi-.n, <&) and 7q(-, X\- n , $) for long-memory processes, it will be necessary to carry out a 
careful study of the [/-process 



U n {r) 



1 



n(n 



1) £ 



{\Xi-Xj\Kr} 



n 



(36) 
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based on the class of kernels {l{|x-y|<r}> x > U £ K,r > 0}. Its asymptotic properties can be 
derived from Prop ositio ns [TOl and [11] in Section [5] which are proved in the companion paper 



Levv-Leduc et al. 



(120091 ). 



2.2.1. Properties of the scale estimator. The next theorem gives the asymptotic behavior of 
the robust scale estimator Q n (X\ :n , $) under Assumption (AEJ. 

Theorem 6. Under Assumption Q n (Xi :n ,$) satisfies the following limit theorems as 

n tends to infinity: 

(i) If D> 1/2, 

v^(Q„ (X 1:re , $) - a) -±> M(0, a 2 ) , 

where a = yO^O), 

a 2 = 7 (0)E[IF(X 1 /a, Q, $) 2 ] + 2 7 (0) £ EflFpd/a, Q, $)IF(X k+1 /a, Q, <&)] , 



fc>i 

and IF(-, Q, <fr) zs defined in JSi 

(«; if d< 1/2, 

n D 

13(D) — (Q n (X 1:n , *) - a) ^(Z 2 , D (1) - Z? )fl (l)) , 

where /3(D) = B((l — D)/2,D), B denoting the Beta function and the processes Z\ t r)(-) 
and Z2,d{~) ar & defined in ( |5^[ ) and < I55|) . 

Theorem [S] is proved in Section O 

Remark 4. Note that in the case (ii ) the limit distribution i s not centered and is asymmet- 
ric. Moreover, it can be proved (see iLevv-Leduc et al.1 (|2009l )) that E[Z2 5 _d(1) — Zi^l) 2 ] = 
-2(3(D)/(-D + 1)(-D + 2). 

Remark 5. Under Assumption (A[2]), it is interesting to compare the asymptotic distribution 
of the proposed estimator Q n (X\._ n , $) with that of the square root of the sample variance 
a 2 n X defined in ([27]) . 

Proposition 7. Suppose Assumption (J^. Then as n —> oo, 
/aj if D> 1/2, 

Vn{°n,x - cr) —>Af(0,d%) , 

where a 2 t is given in [2B\) 
(b) if D< 1/2, 

^(£»)n I? L(n)- 1 (a ni x - <x) a/2 (Z 2>J? (1) - Z 2 D (1)) . (37) 

The rates of convergence of the square root of the sample variance a n ^x a nd of the robust 
estimator Q n (Xi :n ,$) are identical. Moreover, there is no loss of efficiency when D < 1/2. 

The proof of Proposition [7] is in Section [6j 
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2.2.2. Properties of the autocovariance estimator. In this section, we study the asymptotic 
properties of 7q(-, X\ :n , <J>) based on the asymptotic properties of Q n {X\ :n , $). 

Theorem 8. Assume that (AW\j holds and that L has three continuous derivatives. Assume 
also that Li(x) = x l L tyt \x) satisfy: Li{x)/x e = 0(1), for some e in (0,D), as x tends to 
infinity, for all i = 0,1,2,3, where £,W denotes the ith derivative of L. Let h be a non 
negative integer. Then, 7q(/i, Xi m , $) satisfies the following limit theorems as n tends to 
infinity. 

(i) If D> 1/2, 

(l Q (h, X lm , $) - 7 (h)) -±> AA(0, a\h)) , 

where 

a\h)=nxl,(X l ,X l+h ) 2 ] + 2Y,n^(Xi,X l+h )^(X k+1 ,X k+l+h )) , 

k>l 

ip being defined in \31\) . 

(ii) If D< 1/2, 

(i{D)^j- (%(h,X 1:n ,^)-j(h)) 7(Q) +l{H \ z 2 , D (l) - Z liD (l) 2 ) 
L[n) z 

where (3{D) = B((l — D)/2,D), B denotes the Beta function, the processes Zi^(-) and 
Z2,d(-) are defined in |5^| ) and [55\). and 

L{n) = 2L(n) + L(n + h)(l + h/n)" + L(n - h)(l - h/n)~ D . (38) 

Theorem [8] is proved in Section [6l 

Remark 6. Note that the assumptions on Li made in Theorem [5] are obviously satisfied if L 
is the logarithmic function or a power of it. 

Proposition 9. Under Assumption (A\^) with D < 1/2, the robust autocovariance estimator 
7q(/j, Xi- n , <1>) has the same asymptotic behavior as the classical autocovariance estimator 
(EI|). There is no loss of efficiency. 

Proposition [9] is proved in Section [6j 

3. Numerical experiments 

In this section, we investigate the robustness properties of the estimator jQ(h, X\ :n , <£) in 
(|12p . using Monte Carlo experiments. 

We shall regard the observations Xt, t = 1, . . . stationary series Y t , t = 1, . . . , re, 

corrupted by additive outliers of magnitude lo. Thus we set 

X t = Y t + uWt, (39) 

where Wt are i.i.d. random variables such that P (W = — 1) = P (W = 1) = p/2 and P (W = 0) = 
1 — p, where E[W] = and E[VF 2 ] = Vax(W) = p. Observe that W is the product of 
Bernoulli(p) and Rademacher independent random variables; the latter equals 1 or —1, both 
with probability 1/2. (Y t ) is a stationary time series and it is assumed that Y t and Wt are 
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independent random variables. The empirical study is based on 5000 independent replications 
with n = 100, 500, p = 5%, 10% and uj = 10. Other cases were also simulated, for example, 
series with u = 3, 5 which are magnitudes that cause less impact on the estimates compared 
with uj = 10. These additional results are available upon request. 

We consider first the case where Yt follows a Gaussian AR(1) process, that is, Yt = 
Ylj>o 4*\Zt-j with (pi = 0.2, 0.5 and {Z t } i.i.d N(0, 1). Then we suppose that, Y t are Gaussian 
ARFIMA(0, d, 0) processes, that is, 

^^-^£,^±1^ ,40, 

with d= 0.2, 0.45 and {Z t } i.i.d Af(0, 1). 

Classically, scale is measured by the standard deviation a. The robust measure of scale we 
consider here is Q(Fx), defined in ([5]). Recall that one has a = Q(Fx) in the Gaussian case 
(see [7J. We want to compare their respective estimators a n ^x defined in (|27|) and Q n {X\. n , <I>) 
defined in ([17]) . 

The standard deviations of the AR(1) models are Q(Fy) = ay = 1.0206 and Q(Fy) = 
ay = 1.1547 for (f>\= 0.2 and </>i = 0.5, respectively. In the case of ARFIMA processes, the 
standard deviations are Q(Fy) = ay = 1.0481 when d = 0.2 and Q(Fy) = ay = 1.9085 when 
d = 0.45. This is because t he variance of AR(1) is (1 - 4>\ )~ 1 and that of ARFIMA(0,d,0) 



is T(l - 2d)/Y 2 {\ - d) (see iBrockwell and David (|l991h ). Figure [5] and Table □ involve AR 



processes, and Figures El H] and [5] involve the ARFIMA processes. 

3.1. Short-range dependence case. Figure [2] gives some insights on Theorem[2]and Propo- 
sition[3l In the left part of Figure[2l the empirical distribution of the quantities \/n(Q n (X\ :n , <£>) 
ay) and yfn(a n — ay) are displayed. Both present shapes close to the Gaussian density, and 
their standard deviations are equal to 0.8232 and 0.7377, respectively. These empirical stan- 
dard deviations are close to 0.8233 and 0.7500 which are the values of the asymptotic standard 
deviation a in ([26]) and that of \/n(a n — ay) in ([28]) . respectively. The value 0.8233 was ob- 
tained through numerical simulations and the value 0.7500 from the fact that for an AR(1) 
7 (/c) = <^(1 - (/>2)-i and hence a 2 d = (1 + 2<^)/(2(l - </>?)) in fl28|). Hence the empirical 
evidence fits with the theoretical results of Theorem [2] and Proposition [3j 

In the right part of Figure [21 we display the results when outliers are present. The empirical 
distribution of y / n(3 ; n — ay) is clearly located far away from zero. One can also observe the 
increase in the variance. The quantity \/n(Q n {X\- n , — ay) looks symmetric and is located 
close to zero. 

We now turn to the estimation of the autocovariances. We want to use them to get estimates 
for the AR(1) coefficient <j)\. The results are in Table[TJ In this table, 7 and <j>i Q denote the 
average of the Yule- Walker estimates of the AR coefficients based on the classical estimator of 
the covariance 7 and the robust autocovariance estimator 7q(/i, X\ :n , <3>) in (j!2p . respectively. 
The numbers in parentheses are the corresponding square root of the sample mean squared 
errors. The classical estimates were obtained using the subroutine DARMME in FORTRAN 
which uses a method of moments. The robust autocovariance and autocorrelation estimates 



were calculated using the code given in ICroux and Rousseeuwl (jl992l ) . 
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0.8- 0.35- 



0.30- 




-3 -2 -1 1 2 3 10 20 30 40 50 60 70 

FIGURE 2. Empirical densities of the quantities y/n(Q n (Xi :n ,$) — ay) (plain line) and 
s/n(a n — ay) (dotted line) of the AR(1) model with <j>i = 0.2, n=500, without outliers (left) 
and with outliers with p = 10% and to = 10 (right). 

TABLE 1. Results for the estimation of AR(1) model with lo =10 







P 


= 


P = 


5% 


P = 


10% 


01 


n 


01,7 


01, Q 


01,7 


01, Q 


01,7 


01, Q 


0.2 


100 


0.1818 


0.1831 


0.0312 


0.2212 


0.01530 


0.2651 






(0.0112) 


(0.0128) 


(0.0376) 


(0.0229) 


(0.0435) 


(0.0388) 




500 


0.1967 


0.1948 


0.0318 


0.2381 


0.0163 


0.2881 






(0.0019) 


(0.0025) 


(0.0303) 


(0.0051) 


(0.0357) 


(0.0150) 


0.5 


100 


0.4767 


0.4747 


0.0998 


0.5762 


0.0495 


0.6924 






(0.0084) 


(0.0106) 


(0.1740) 


(0.0262) 


(0.2142) 


(0.0712) 




500 


0.4967 


0.4927 


0.1030 


0.6012 


0.05647 


0.7216 






(0.0015) 


(0.0021) 


(0.1598) 


(0.0141) 


(0.1988) 


(0.0558) 



It can be seen from Table [T] that both autocovariances yield similar estimates for (pi when 
the process does not contain outliers. However, the picture changes significantly when the 
series is contaminated by atypical observations. As expected, the estimates from the classical 
au tocovariance estimat or are extremely sensitive to the presence of additive outliers. As noted 
Faiardo et all (|2009h . for a fixed lag k, the classical autocorrelation tends to zero as the 



m 



weight u — > oo, and this produces a loss of memory property (that is, the dependence structure 
of the model is reduced), and consequently this leads to parameter estimates with significant 
negative bias. It is worth noting that the estimator based on the robust autocovariance (|29[) 
yields much more accurate estimates when the data contain outliers. 

3.2. Long-range dependence case. In the case of the long-memory process ARFIMA(0, d, 0) 
defined in (|4Up . we choose d = 0.2 and d = 0.45, corresponding respectively to D = 0.6 and 
D = 0.1 (see !34p . In the first case D > 1/2, in the second, D < 1/2, corresponding to the two 
cases of Theorem[6l For d = 0.2, the empirical density functions of i/^(Qn {Xi-. n , — cry) and 
\pn(d n — <jy) are displayed in Figure [3] with and without outliers. When there is no outlier, 
both shapes are similar to that of the Gaussian density, and their standard deviations are 
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equal to 0.9043 and 0.8361, respectively, corresponding to an asymptotic relative efficiency of 
85.48%. As shown in the right part of Figure [3j the classical scale estimator a n is much more 
sensitive to outliers than the robust one Q n . The empirical density in the case of outliers is 
centered around 50. 

0.6 -I 0.4 ] 



0.5- 




-4 -2 2 4 10 20 30 40 50 60 70 



FIGURE 3. Empirical densities of y / n(Q n (Xi :n ,$) — ay) (plain line) and y/n(a n — ay) 
(dotted line) for the ARFIMA(0, d, 0) model with d = 0.2, n=500 without outliers (left) and 
with outliers with p — 10% and u — 10 (right). 

To illustrate part (ii) of Theorem [61 we consider the empirical density functions of the 
quantities n 1 ~ 2d (Q n (Xi :n , $) — ay) and n 1_2d (<7 n — ay) when d = 0.45 (D = 0.6) as displayed 
in Figure HI The left part of Figure 2] shows densities having means close to -1.1161 which 
is the value of the theoretical mean given in Remark 01 Both curves present, in fact, similar 
empirical standard deviation which is in accordance with Proposition!?! The impact of outliers 
on the estimates is clearly shown in the right side of Figure H] where one observes patterns 
similar to those of the previous examples. 




-1.5 -1.0 -0.5 0.0 0.5 -10 1 2 3 4 



FIGURE 4. Empirical densities of the quantities n 1 2d (Q„ (Xi ]n , 4>) — ay) (plain line) and 
n 1 ~ 2d (a n - a Y ) (dotted line) of the ARFIMA(0, d, 0) model with d = 0.45, n=500, without 
outliers (left) and with outliers p = 10% and u = 10 (right). 

Finally, the plots of the autocorrelations are displayed in the left and right parts of Figure 
[5] for models without and with outliers, respectively. The figures also provide the population 
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au tocorrela t ion fu nction p(h) = T(l — d)T(h + d)/(T(d)T(l + h — d)) as a function of the lag 
h (jlloskingl (|l98lh ). 





FIGURE 5. Sample correlations of the ARFIMA(0, d, 0) model with d = 0.2, n=500 without 
outliers (left) and with outliers with p = 10% and u) — 10 (right), (a) is the population 
correlation, and (b) and (c) are the robust and the classical sample autocorrelations, respec- 
tively. 

In the absence of atypical observations (left part of Figure EJ, both sample functions display 
a similar behavior. However, when the data contains outliers (right part of Figure [5]) the 
classical sample autocorrelation is clearly distorted. 

3.3. Non-Gaussian observations. We now examine the behavior of the autocovariance es- 
timator when it is applied to non Gaussian observations. To do so, we generated observations 
(X t )i<t< n as follows, 

X t = faXt-x + Z t , 

where 4>\ = 0.9, e = 0.4, Zt = Wt + sY^ ! , where Wt and Yt are independent random variables 
such that (Wt) and (Yt) are i.i.d standard Gaussian random variables. An example of a 
realization of (Zt)i<t< n is given in the histogram of Figure with n = 500. As we can see from 
this figure, the presence of e in the definition of Zt produces an asymmetry in the data. In the 
right part of this figure, we displayed the average of the robust autocovariance 7q(/i, X± :n , $) 
in (|29p and the classical autocovariance 7(h) defined in Remark [3l for h = 1, ... ,40 and 1000 
replications. From this figure, we can see that the robust autocovariance estimator does not 
seem to be affected by the skewness of the data. 

4. An application 

The Nile data is used here to illustrate some of the robust methodologies discussed previ- 
ously. The Nile River data set is a well-known and interesting tim e series , whic h has been 
extensively analyzed. This data is discussed in detail in the book by iBeranl (119941k It is firs t 
introduced in Section 1.4 on p. 20, and i s complete l y tab ulated on pp. 237-239. IBeranl ()1994l ) 
took this data from an earlier book by (jToussounl . Il925l . pp. 366-404). The data consists of 
yearly minimal water levels of the Nile river measured at the Roda gauge, near Cairo, f or the 
years 622-1284 AD (663 observations); The units for the data as presented bv IBeranl (|1994l ) 
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FIGURE 6. Left: Histogram of one realization of (Zt)i<t<500- Right: Theoretical autoco- 
variance('.'), robust autocovariance ('o'), classical autocovariance (V) for h = 1, . . . ,40. 



are centimeters (presumably above some fixed reference point). The empirical mean and the 
standard deviation of the data are equal to 1148 and 89.05, respectively. 

The question has been raised as to whether the Nile time serie s contains out l iers; s ee for 
example (jl992h . lRobinsonl dl995h.lChareka et al.1 l|200d ) and lFaiardo et al.1 (|2009h . The 



test procedure developed by 



Chareka et al 



(|2006l ) , suggests the presence of outliers at 646 AD 
(p- value 0.0308) and at 809 (p- value 0.0007). Another possible outlier is at 878 AD. A plot of 
the time series where the observations which have been judged to be outliers are marked, is 
shown in the left part of Figure E and the right part of this figure displays the histogram of 
the data. Although the theory developed in this paper is related to Gaussian processes, we 
believe that the small asymmetry of the data does not compromise the use of this series as 
an illustration of our robust methodology. A way to avoid this asymmetry is to consider the 
logarithm of the data. However, this does not make a significant difference in the estimates. 




Figure 7. Left: The Nile River data plot. Right: Histogram of the Nile River data. 
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The left part of Figure [8] displays plots of the classical and robust sample autocorrelation 
functions of the original data. The autocorrelation values from the former are smaller than 
those of the latter one. However, the difference between the autocorrelations may be not 
large enough to suggest the presence of outliers. Thus, to better understand the influence of 
outliers on the sample autocorrelation functions in practical situations a ne w dataset with 



artific ial outliers was generated. We replaced the presumed outliers detected bv lChareka et al 



(1200(3 ) by the mean plus 5 or 10 standard deviations. The sample autocorrelations (robust 
and classical ones) were again calculated, see the right part of Figure [HJ As expected, the 
values of the robust autocorrelations remained stable. However, the classical autocorrelations 
were significantly affected by the increase of the size of the observation. This is in accordance 
with the results presented in the simulation section. 
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Figure 8. Left: Classical (plain line) and robust (dotted line) sample auto- 
correlation functions of the Nile River data. Right: Classical (plain line) and 
robust (dotted line) sample autocorrelation functions of the Nile River data 
and classical sample autocorrelation f unctions of the Nile R iver data with arti- 
ficial outliers at the times detected bv lChareka et al.1 (j2006l ) (original data plus 
5 standard deviations with " □ " and original data plus 10 standard deviations 
with " V "). 



5. Asymptotic behavior of [/-processes 
Consider the [/-process {U^{r),r G /} satisfying 



U ^ {r) = ^hhD ^ ^-{G{Xi,Xj)<r} ,r E I (41) 

l<i?y<n 

based on the class of kernels 

ka(x,y,r) = l {G{X)y) < r} . (42) 

where / is an interval included in M, G is a symmetric function i.e. G(x,y) = G(y,x) for all 
x,y in M, and the process (ATj)j>i satisfies Assumption (AEJ) with 7(0) = 1. 
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Th e asymptotic properties of these [/-processes have been studied in iLevv-Leduc et al 



(|2009i ) . They are based on the computation of the Hermite rank of the class of functions 
{l{G( v )<r} - U G (r), r e 1} where 

U G (r) = / t {G{x ^ y) < r} (t>{x)(t>{y)&x&y , for all r € I . (43) 

The Hermite rank of the class of functions {l{G( v )<r} — U(r),r £ 1} is obtained by expanding 
the function l{G( v )<r} in the basis of Hermite polynomials with leading coefficient equal to 
1: 

1 {G(x )W )<r} = E ^rr-H p (x)H q (y) , for all x, y in R , (44) 
p,q>o P ' q ' 

where a Piq (r) = E \\^ G ^ X ,Y)<r}H p {X)H q {Y)\ , X and Y being independent standard Gaussian 
random variables. The first few Hermite polynomials are Hq(x) = 1, H\{x) = x, H2(x) = 
x 2 — 1, H%(x) = x 3 — 3x. Note that ao,o( r ) is equal to U G (r) for all r, where U G (r) is defined 
in (|43j) . The previous expansion can also be rewritten as 

l {G{x , y) <r}-U G (r)= Yl ^rH p (x)H q (y) , (45) 

p,9>0 

p+q>m 

where m = m(r) is called the Hermite rank of the function l{G( v )<r} — U G (r) when r is fixed. 

We state the results for family of kernels having Hermite rank equal to m = 2 (this is all 
we need here) and refer to ILevv-Leduc et al .1 ( 2009 ) for other cases. 

Proposition 10. Let I be a compact interval o/R, let kc(-, -,r) be defined in \4ty , and let 

k Gtl (x,r) =E[k G (x,Y,r)\ , x€R,r€l , (46) 

where Y is a standard Gaussian variable. Suppose that the Hermite rank of the class of 
functions {k G (-,-,r) — U G (r) ,r € J} is m = 2 and that Assumption (04TJJ) is satisfied with 
7(0) = 1 and 1/2 < D < 1. Assume that k G satisfies the following three conditions: 

(i) There exists a positive constant C such that for all s, t in I, u, v in R, 

E [\k G (X + u,Y + v,s)- k G (X + u, Y + v, t)\] < C\t - s\ , (47) 

where (X, Y) is a standard Gaussian random vector, 
(ii) There exists a positive constant C such that for all i > 1 and s,t in I, u, v in R, 

E[\k G (X 1 + u,X 1+e + v,t)-k G {X 1 ,X 1+£ ,t)\]<C\u-v\ . (48) 

E[\k G (X 1 ,X 1+e ,s)-k G {X 1 ,X 1+ i,t)\]<C\t-s\ , (49) 
(Hi) There exists a positive constant C such that for all t in I, and x, u, v in R, 

\k Gtl (x + u,t)- kG,i(x + v,t)\< C\u - v\ , (50) 
|fcG,i(x, s) - k Gjl (x, t)\ < C\t - s\ . (51) 
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Then the U -process {^/n{U G (r) — U (r)) , r E 7} defined in (|4ip and ()43|) converges weakly in 
the space of cadlag functions on I, T>(I), equipped with the topology of uniform convergence 
to the zero mean Gaussian process {W G (r),r £ 1} with covariance structure given by 

E[W G (s)W G (t)} = 4 Cov(k G , 1 (X 1 ,8),k G ,i{X 1 ,t)) 

+ 4^ Cow{k G ^{X l ,s),k Gtl {X l+1 ,t)) + Cow(k Gtl {X l ,t),k Gtl (X t+l ,s)) . (52) 



Moreover, for a fixed r in I, as n tends to infinity, 



V^(U G (r)-U G (r)) 



i=i 



r)-U G {r)] +o P (l) 



(53) 



We now consider the case where D < 1/2. In this case, the normalization depends, as 
expected, on D and the slowly varying function L and the limiting distribution is no longer a 
Gaussian process. Let (^i,i)(t))tgR + denote the standard fractional Brownian motion (fBm) 
and (Z2 t D(t))t€R + the Rosenblatt process. They are defined through multiple Wiener-Ito 
integrals and given by 



Zx, D {t) 



and 



Z2,D(t) 



(u — x)+ D+l ^ 2 (u — y)j} D+l ^ 2 du 



dB(x) 



< D < 1 



(54) 



dB(x)dB(y), < D < 1/2 , (55) 



where B is the standard Brownian motion, see lFox and Tag qui ( 19871 ). The symbol J means 
that the domain of integration excludes the diagonal. Introduce also the Beta function 



poo 

Jo 



T(a + 0) 



a > 0, > . 



(56) 



Proposition 11. Let I be a compact interval o/K Suppose that the Hermite rank of the class 
of functions {k G (-, -,r) — U G (r) ,r £ /} is m = 2 and that Assumption is satisfied with 
7(0) = 1 and D < 1/2. Assume the following: 

(i) There exists a positive constant C such that, for all i > 1 and for all s,t in I, 

E[\k G (X 1 ,X 1+i ,8)-k G (X 1 ,X 1+ e,t)\]<C\t-8\ . (57) 

(ii) U G is a Lip schitz function 

(Hi) The function A defined, for all s in L, by 

A(s) = E[k G (X, Y, s)(\X\ + \XY\ + \X 2 - 1|)] , (58) 

where X and Y are independent standard Gaussian random variables, is also a Lips- 
chitz function. 

Then, the U -process {U G (r) — U G (r),r £ 1} defined in fl^ i[ ) and |^3| ) has the following 
asymptotic properties: 

{n D L-\n){U G (r)-U G (r))-rtl} 
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converges weakly in the space of cadlag functions T>(I), equipped with the topology of uniform 
convergence, to 

{f3{DY l [ai,i(r)Zi jD (l) 2 + a 2i0 (r)Z 2>D (l)] ;r G 1} , 

where the fractional Brownian motion and the Rosenblatt process Z 2 ^£>(-) are defined in 

( |5^[ ) and < I55|) respectively and where (3(D) = B((l — D)/2, D), B denoting the Beta function, 
defined in §5t 



Propositions [10] and [TT] will be applied to the t7-process U n {r) in (|36p with 

U(r)= f l {llx _ yl < r} dF(x)dF(y)=T 1 (F)[r], (59) 



with Ti given in ()13p . By Lemma the Hermite rank of the class of functions {t\ x - y \< r ~ 
U(r),x,y G R, r G 7} is equal to 2 where 7 = [ro — n,ro + rj] for some positive rj defined in 
Lemma [TH and where ro = l/c(<5) (see (|10p ) is such that 

Ti ($) [r ] = Tj (*) [l/c($)] = 1/4 . (60) 
6. Proofs 

Proof of LemmaUl Denote by F the c.d.f. $^,0- of Xi. Since a n (F n — F) converges in distri- 
bution in the space of cadlag functions equipped with the topology of uniform convergence, 
the asymptotic exp ansion (1181) can be de duced from the functional Delta method stated e.g. 
in Theorem 20.8 of van der Vaart ( 19981 ). To show this, we have to prove that Tq = T\ o T 2 



is Hadamard differentiable, where T\ and T 2 are defined in (|13|) and (|14p respectively and 
that the corresponding Hadamard differential is defined and continuous on the whole space of 



cadla g functions. For a definition of Hadamard differentiability, we refer to (jvan der Vaartl . 



19981 . Chapter 20). 

We prove first that the Hadamard differentiability of the functional T\ defined in (|13p . Let 
{gtj be a sequence of cadlag functions with bounded variations such that \\gt — p|| — > 0, as 
t — > 0, where g is a cadlag function. For any non negative r, we consider 

t- 1 {T 1 (F + tg t )[r]-T 1 (F)[r]} 

= 2 1 I l{\ x -y\< r }dF(x)dg t (y) +t / t{\ x -y\< r }dgt(x)dg t (y) . 

JR JR JR JR 

Since 

/ / 1 {\x-y\<r}dF{x)dg t {y) - ^{\x- v \<r}dF{x)dg{y) 
JrJm. JrJr 

(gt(x + r) - g(x + r))dF(x) - / (g t (x - r) - g(x - r)) dF(x) < 2 \\g t - gW^ , 

JR 

as t tends to zero, the Hadamard differential of T\ at g is given by: 

{DT x {F).g){r) = 2 / / l^,^} dF(x)dg{y) = 2 / {g(x + r) - g(x - r)}dF(x) . 

JR JR JR 
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By Lemma 21.3 in Ivan der Vaartl (119981) . To is H adamard differentiable. Finally, using the 
Chain rule (Theorem 20.9 in Ivan der Vaartl (|1998l )). we obtain the Hadamard differentiability 
of To with the following Hadamard differential: 

(DT\(F) .g)(T (F)) 2 Ug(x + T (F)) - g(x - T (F))}dF(x) 



DT (F).g 



(Ti(F))'[Tq(F)] 



(Ti(F))'[T (F)] 



(61) 



In view of the last expression, DTq(F) is a continuous fu nction of g and is de fined on the 
whole space of cadlag functions. Thus, by Theorem 20.8 of van der Vaart ( 19981 ) . we obtain: 



a n (Qn (X 1:n , $) - Q(F)) = c($) DT (F).{a n (F n - F)} + o P (l) , 



(62) 



where c($) is the constant defined in ([Ifi]). Bv (fT3~|). Ti(F)[r] = f R [F(x + r) - F(x - r)]dF(x) 
and since F(-) = $ M)£r (-) = $((• - aO/V), we S et 

(T 1 (F))'[r] = - / *(y+-)*(y)dy. 

Since a = Q($ M , CT ) = c(*)T (F) by ©, we get 

(T 1 (F))'[T (F)] = 2a" 1 J <p(y)<p(y + l/c($))dy . 

Applying (fBTj) with To(F) = <7/c(3>), using ([63]) . and setting g = a n (F n — F), we get 

DT (F).{a n (F n - F)} = A n - B n , 

where 



A n = a n ((T 1 (F))'[T (F)]) 



-i 



cm 



F x 



a 



c(d>) 



cLF(x) 



and S n has the same expression with F replaced by F n . The integral in A n equals 

/ / M\y- x \<a/ cm dF(x)dF(y) = Ti(F)[r CF)] = 1/4 , 
by definition (see ©). The corresponding integral in B n equals 



(63) 
(64) 
(65) 

(66) 



n ^ i $ 



0" 



cm 



Xi-fi 



cm 



The result follows from (|62p . (|6ip . (|64p . (|63|) and the above expressions for A n and B n . □ 

Proof of Theorem^ Assumption (Ad]) and the Theorem of Csorgo and Mielniczuk ( 19961 ) 
implies that y/n{F Tl — 3>o,ct) converges in distribution to a Gaussian process in the space of 
cadlag functions equipped with the topology of uniform convergence. Thus, the asymptotic 
expansion of a n (Q n {X\- n , $) — a) obtained in (fT5|) is valid with a n = yfn. We thus have to 
prove a CLT for n- 1 / 2 £™ =1 W(Xi/a, Q, Using Lemma[12]below, we note that the Hermite 
rank of IF(-, Q, <£) is equal to 2 and the conclusion follows by applying (jBreuer and Maiorl . 
19831 . Theorem 1). □ 
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Proof of Proposition^ Note that o\ x = 7(0)<7 2 Y , where (i^)i>i satisfies (A[T|) with 7(0) = 1. 
Observe that 5^ y - 1 is a [/-statistic with kernel k(x,y) = (x — y) 2 /2 — 1. The Hoeffding 
decomposition of this kernel is given by k(x, y) = (x 2 — l)/2 + (y 2 — l)/2 — xy. From this, we 
obtain the corresponding Hoeffding decomposition of cr 2 Y — 1 as 



1 " 



n(n 



1) ^ 



(67) 



Under Assumption (AQ]), the first term of this decomposition is the leading one. Then, using 
( Breuer and Maiorl . Il983l . Theorem 1), we get that \fn(d 2 lX — a 2 ) converges to a zero- mean 
Gaussian random variable having a variance equal to 2(7(0) 2 + 2 Y^k>l li k ) 2 )- Usm S the Delta 
method to go from a 2 x to a n ,x, setting f(x) = y/x, so that /'(cr 2 ) = l/{2\/~a 2 ~) = 1/(2<t), we 
get that the asymptotic variance of y/n{d n ,x — cr) is thus equal to (f28j) . 



By Lemma [T2| the Hermite rank of IF(., Q, <£) is equal to 2, hence using (jArconesl . 11994 
Lemma 1), a 2 defined in (1261) satisfies 



a 2 < 7 (0)- 1 E[IF(X 1 / ( 7, Q, $) 2 ]{ 7 (0) 2 + 2 ]T ^{k) 2 } 



k>l 



Finally, in this case, using that E[IF(Xi/o-,Q,$) 2 ] « 0.6077 (jRousseeuw and Crouxl . 119931 
p. 1278), the relative asymptotic efficiency cr 2 t /cr 2 of Q n (Xi :n , 3>) compared to <7 n ,x is larger 
than 82.27% since 



(27(0))- 1 (7(0) 2 + 2E fe >i7(fe) 5 



7 (0)-iE[IF(Z 1 /a, Q, $) 2 ]{ 7 (0) 2 + 2 £ fc>1 l(k) 2 } 



0.5/0.6077 « 82.27% . 



□ 



Proof of Theorem^ Let 3> CT ,+ and 3>o-,- denote the c.d.f of (Xi+Xj+zJi^i and (X— Xj + /j)j>i, 
respectively. Let also denote by F + ^ n _h and F_^ n _h the empirical c.d.f of (Xj + Xj + / l )i<j< n _/ l 
and (Xj — Xj + ^)i<j< n _^, respectively. Since (Xj)j>i satisfy Assumption (A[T]), it is the same 
for (Xj + Xj_|_/j)j>i and (Xj, — Xj_i_h),;>i with scales equal to Q(^ ff .+) and Q($> a _), respectively. 



Thus, using the Theorem of lCsorgo and Mielniczukl (|1996l ). we obtain that \/n — h(F +tn -h — 
$ CT ,+) converges in distribution to a Gaussian process in the space of cadlag functions equipped 
with the topology of uniform convergence and that the same holds for \/n — h(F_ n _/j — 
<&&-)■ As a consequence, the expansion (fl8]) is valid for Q n -h {Xi- n -h + Xh+i :n , and 
Q n _ h (Xi :n _ h - X ft+ i :n , $) with a n _ ft = Vn- h, that is 



Vn-h [Q n _ h (X 1:n _ h ± X h+1:n , $) - Q($ ff ,±) 



n 



^iF(x i ±x +h ,g,$ CTi± )+ 0P (i) 
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Then, applying the Delta method (jvan der Vaartl . ll998l . Theorem 3.1) with the transformation 
b(x) = x 2 , b'(x) = 2x, we get 



[Q n _ h (X 1:n _ h ± X h+1:n , $) 2 - Q 2 ($ CT , ± )] 



2Q($ 



n—h 



<T,±J 



\fn — h 



lF(Xi ± Q, $ CT ,±) + 0P (1) . 



1=1 



Hence 7q(/i, Xi :n , $) in (|29|) satisfies the following asymptotic expansion: 



n—h 



V^h(j Q (h,X 1:n ,^) - {Q 2 (^ + ) - Q 2 (^_)} /A) =-^==J2^(X i ,X i+h )+o P (l) , 

(68) 



i=l 



where for all x and y. 



IF (X + y, Q, <D CT ,+) - Q($ CT ,-) IF (x - y, Q, $ CT ,_)} 



Using the identity (|19p . ^ has the expression given in (|31|) . We have now to prove a CLT for 
(n — h) -1 / 2 y~!"~ | fe ip{Xj, Xj+h). Using Lemma \13\ the definition of the Hermite rank given in 
(lArconesl. 1 19941. p . 2245) and Assumption (AH]), we obtain that Condition (2.40) of Theorem 
4 (jArconea . [1994J, p. 2256) is satisfied with r = 2. This concludes the proof of the theorem 
by observing that {Q 2 ($ CT ,+) - Q 2 ($ CT ,_)}/4 = E[XiX 1+h ] = j(h) (see [ft. □ 

Proof of Theorem® Since, by scale invariance, Q n (Xi :n , $) — a = <r(Q n (X± :n /a, $) — 1), we 
shall focus in the sequel on the case 7(0) = 1. First, note that using Lemma [T41 below the 
Hermite rank of the class of functions {lri._.i< r } — U(r) , r E [to — rj, ro + rj\} is m = 2, where 
U is defined in (f59|) and ro in ([60]) . 

(i) Suppose first D > 1/2. Let us verify that the assumptions of Proposition 1101 hold. 
Conditions (|47|) and ([48 p are easily verified. Let us check Condition ([49 p . Note that for all 
t > 1, X\ — X\ + £ ~ JV(0, 2(1 — 7(^))), thus if t < s, there exists a positive constant C such 
that, 



E[fc(Xi J X 1+<) a)-fc(Jfi,X 1+/) t)]=P(t < < s) < 



V4tt(1- 7 W) 



I* — s] < C|i— s| , 



where k(x,y,r) = tn x _ y \ <r \. Since 7^) — ► as t — > 00, we obtain 
Conditions (1501) and (ISTI) are satisfied since 



fci(s, r) = E[l { |,_y|< r} ] = $(x + r) - $(x - r) . (69) 
Now consider the process 

{\/n(?i(F n )[r] -Tx(F)[r]),r G [r - 77, r + 7/]} , (70) 

where F = $ and 

Ti(F)[r] = / l { |j / _ a .i< r} d$(a;)d$(y) = / [$(x + r) - $(x - r)]d$(x) . 
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By Proposition [TUl the process (|70p converges weakly to a Gaussian process in the space of 
cadlag functions equipped with the topology of uniform convergence for some r] > when 
D > 1/2. 

(ii) Suppose now D < 1/2. Let us check that the assumptions of Proposition 1111 hold. 
Condition (|57p holds since it is the same as Condition (|49p . Since k\ is a Lipschitz function, 
so is U defined in ([Blip . Let us now check Condition ([55]) . If s < t 

1 {s<x- y <t}{\x\ + \xy\ + \x 2 - l\)<f>(x)<f>(y)dxdy = / I / <fi(y)dy ) \x\4>(x)dx 

\Jx-t J 

+ / ( / \y\4>{y)dy ) \x\<j)(x)dx + I ( / (p(y)dy ) |x 2 - l|0(z)da; . 

./R Vii-t / JR \Jx-t J 

Since </>(•) and |.|</>(-) are bounded and that the moments of Gaussian random variables are 
all finite, we get (|58p . Then, applying Proposition [11] and Lemma leads to the weak 
convergence of the process {(5(D)n D / L(n)(Ti(F n )[r] — Ti(F)[r]),r G [tq — 77,7*0 + r/]} to 
{0(r/^2)(Z 2 , D (l) - Z ljD (l) 2 );r E [r - i h r + ??]}. 

We now want to use the functional Delta method as in the proof of Lemma Q] in both cases 
(i) an d (ii). 



By (jvan der Vaartl . Il998l . Lemma 21.3), T 2 defined in (|14p is Hadamard differentiable with 
the following Hadamard differential: DT 2 {T 1 (<Z>)) -g = -g(r )/(Ti(^)y[r ] . Thus DT 2 (Ti($)) 
is a continuous function with respect to g. By the functional Delta method, with To = T 2 oTi, 
we obtain the following expansion: 

a„.(Qn(X 1:n ,d.)-Q($)) = c(c&) a n (T (F n )-T (<S>)) = -c(<&) a n (Tl( ^ ~ ^ ($))N +o P (l) , 

(Ti($)) '[r J 

(71) 

where a n = -y/n in the case (i) and a n = f3(D)n D / L(n) in the case (ii). In case (i), 

( )v ^ — m($))'N ^(°.^). 

where a 2 is given by Equation (|52p in Proposition 1101 



o\ = 4Var 



c($) 



m(*)y[r ] 



iJ^Cov 

fc>l 



c($) c(<E>) 

„/c! (X x , r ) , - — — — ^fc! (X fc+1 , r ) 



(T!(*))'[ro] ±v u " (T!(^))'[r ] 
where fci is defined in (|69p . Since 

E[fc 1 (X 1} ro)]=E[*(X 1 +r )-*(Xi-ro)]= / l { u_d< r} d<I>(x)d<I>(y) = 1/4 



, (72) 
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by dSQD and r = l/c($) by ©, we get using (JH3J) that 

-2c($)[fci(Xi,ro)-E(fci(Xi,r ))] 
m(*))'[r ] 

= **) ^ + ^ ^ » - + V C W) = IF^, Q, ») , (73) 

1 ; 2/ K 0(y)0(y + l/ C ($))dy V V ; 

where IF(-,Q,$) is defined in {SO]). Using ([72"]) . ([75]) and ([55 ]) in Lemma [TJ we get that 

of = E[JF(X 1 ,Q, $)] + 2 ^E[IF(X l5 Q, $)IF(X jfc+1 , Q, $)] , 

fc>i 

which concludes the proof of (i). In the case (ii), in view of (|7ip . it is sufficient to show that 

^a/ffl nD (^lC-^n) -Ti($))[r ] rf 1 2 

This result follows from the convergence in distribution of (3{D)n D / X(n)(Ti (F n ) [r ]-Ti ($) [r ]) 
to <i>(r /V2)(Z 2>D (l)-Z 1)D (l) 2 ), §3j) and the identity -c($) ^»(r /v / 2)(2 J R (/>(y)(/>(y+ro)dy) _1 = 
1/2. This identity follows from ^(ro/v 7 ^) = -(2 v / 7r)~ 1 exp(-rQ/4)r and r = l/c($). □ 

Proof of Proposition [?| Using the same arguments as those used in the proof of Proposition 
[3j we get that o 2 nY — \ satisfies the Hoeffding decomposition ([67]) . where iXi)i>\ satisfies (AE]) 
with 7(0) = 1. 



(a) If D > 1/2, using iDehling and Taqqul (|199ll ). the first term in the decomposition ([6] 
is the leading one, then using the same arguments as those used in the proof of Proposition 
[5J we get that the asymptotic variance of yfn {5 n ,x — o) is equal to 

(2 7 (0))- 1 (7(0) 2 + 2^ 7 (A ; ) 2 ). 

k>l 

Using the same upper bound as the one used in the proof of Proposition [5J we get that the 
relative efficiency of the robust scale estimator i s, in this case, large r than 82.27%. 

(b) If D < 1/2, we can apply the results of Dehling and Taqqu ( 199ll ) and the classical 
Delta method to show that 

P(D)n D L{n)-\a n , x - a) a/2 (Z 2 , D (1) - . 

□ 

Proof of Theorem^ Let an d denote the c.d.f of {Xi+X i+ h)i>i and (Xi — X i+ ^)i>i, 
respectively. Since (X,),>i satisfies Assumption (A[2]), a straightforward application of a 
Taylor formula shows that the same holds for (Xj + Xi+h)i>i with a scale equal to Q(<J> CT) +) 
and L replaced by some slowly varying function L. Thus, in the case (i), where D > 1/2, we 
obtain that Q n -h ({^l.-n-ft + ^h+i-.n} I <?,+)■> satisfies the expansion ([7T]) with a n = s/n 
as proved in the proof of Theorem [6j Using (|53p , we get that 

Vn~{Q n ~ h (X 1:n _ h + X h+1:n , $) - Q($a,+)} 

= ~ m^lyN ^ M{Xl + X »h}/Q(**,+),ro) - U(r )] + o P (l) , 
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where k± and U are denned in ([69)) and (|59|) . respectively. Thus, using (fT3[) and (fl~9|) . we 
obtain 

1 ™ 

V^{Qn-/ l (^l:n^ + ^+l:n,$)-Q(^,+)} = -p= Y\ W{Xi + X i+h , Q, + Op(l) . (75) 

In the case (ii), where D < 1/2, we get from the expansion ([71 p that 

/?(D)i- MQ n _ h ({X l!n _ h + X h+l!n }/Q(* tr , + ), $) - 1) 

L(n — h) 

= -# P k 77 T^Tvr i +op(1). 76 

L(n-h) { T i{®))[ro\ 

where F + ^ n _h denotes the empirical c.d.f of ({A^ + ^Q+/ t }/Q( < I > cr,+))i<i<n-/i- 

Let us now focus on the autocovariances and consider first the case (i) where D > 1/2. Let 
us denote by j-(k) the autocovariance of the process (Xi — Xi + h)i>i computed at lag k. Using 
a Taylor formula, 7— (&) = 0(k~ 2 ~ D+€ ), for e in (0, D) such that Li(x)/x e = 0(1), as x tends 
to infinity, for all i = 0, 1,2,3. Let F_^ n _h denote the empirical c.d.f of (Xi — Xi + h)\<i< n -h- 
Since |7_(Ar)| < oo, the process (Xi — Xi + h)i>l satisfies Assumption (A[T]) implying that 
y/n(F_ >n _f l — converges in distribution to a Gaussian process in the space of cadl ag 

functions equipped with the topology of uniform convergence ([Csorgo and Mielniczuld (jl996l )). 
As a consequence, by Lemma [Q the expansion (fl~8j) is valid for Q n -h (^l:n-h ~ ^h+l:m^) 
with a n = \Jn where IF is defined in (|2Up . 

Then, in the case (i), using the Delta method (Theorem 3.1 P. 26 in Ivan der Vaartl ([199a )). 
7q(/i, Xi :n , <&) satisfies the following asymptotic expansion as in f|68j) : 

1 n—h 

V^h (%(h, X 1:n , $) - {Q 2 ($ a ,+) - Q 2 ($ CT ,-)} /4) = - ? == ^ V(X, A 4+h )+op(l) , 

2 = 1 

(77) 

where ^ is defined in ([31|) . Hence, we have to establish a CLT for (n—h)' 1 / 2 y~!^L~ 1 fe i b (Xj, X ^). 
Using Lemma [T3l the definition of the Hermite rank given on p. 2245 in lArconesI (|l994l ) and 
Assumption (A Bt with D > 1/2, we obtain that Condition (2.40) of Theorem 4 (P. 2256) in 
Arconesl (|l994h is satisfied with r = 2. This concludes the proof of (i) by observing from ((7J) 



that {Q 2 ($ CT , + ) - Q 2 ($ CT ,_)}/4 = E[JTiX 1+h ] = 

Consider the case (ii) where D < 1/2. Using ® and -y(h) = [Q 2 (<$>^ + ) - Q 2 ($ CT _)]/4, 
one has 

%(h,X lin ,$)- 1 (h) = A+-A-, (78) 

where 

At = (Xv.n-h ± Xft+lsn, $)' " Q 2 (^,±)] • 

We first show that the contribution of A~ is negligeable. Since the expansion (j!8p holds 
for y/n — h(Q n _h (Xi :n -h — Xh+i-.n,®) — Q(^a,-)), we conclude by arguing as in the proof 
of Theorem [21 that this expression is Op(l). Applying the Delta method, we get the same 
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type of result for Q 2 n _ h , namely y/n - h(Q n _ h (X 1:n _ h - X h+1 . n ,$) 2 - Q 2 ($ (T _)) = Op(l) 
and therefore, since D < 1/2, 



(r, - h) ' 1 / 2 

/ W ~ ] - V^hA- = o P (l) . (79) 
L(n — h) 

We now turn to A£. Applying the Delta method with the transformation b(x) = x 2 to (|76p 
and using ([71]) yields 

(n-h) D + c(*)/3(D) (n-h) D 2 (T^F+^-T^mro] , n m 

W !M " = 2— (< ^ +) mMTN + op(1) 

The result follows from (USD, <|7jg) and Q 2 ($ CT ,+) = Var(Xi + = 2(7(0) + 7(/i)). 

□ 

Proof of Proposition 0. The classical autocovariance estimator can be obtained from the clas- 
sical scale estimator ct n) x as in Equation (|12[) . More precisely, a straightforward calculation 
leads to 

l(h) = i (^_h,X 1:n _ h +* h+1:n " £n-h,X 1:n _ h -* h+1:B ) (1 + o(l)) + Op (4^ J • (80) 

In order to alleviate the notations, a n -h ) x 1 . n _ h +x h+1 . n will now be denoted by a + and ?n-/i,x 1 .„_ fc -x ft+1 .„ 
by a-. 

On the one hand, using Proposition [7] and the same arguments as in the beginning of the 
proof of Theorem [8j we have 

P(D)£- (a+ - *+) -±> ^(Z 2 , D (1) - Z hD (l) 2 ) , 
L[n) L 

where cr+ denotes the standard deviation of X\ + X\+h and L(n) is defined in Theorem [HJ 
Note that a\ = 2(7(0) + j(h)). By the classical Delta method, we thus obtain 

P(D)£- (a 2 + - a\) 2(7(0) + j(h))(Z 2 , D ^) - ^i,d(1) 2 )- (81) 
L(n) 

On the other hand, by the same arguments as in Theorem [8j the process (Xj — JQ_|_k)i>i 
satisfies Assumption (A[TJ). Let cr 2 = 2(7(0) — 7(^1)) denote the variance of X\ — X\ + h- Then 
as in the proof of Proposition [21 \/n (a 2 — a 2 _ ) converges in distribution. This implies that 

P(D)^-(a 2 _-a 2 _)=o P {l). (82) 
L{n) 



Using ([HOD, (JSU and flSJ), we get: 
n D 

(WO " , 
L(n) ^ 



□ 
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7. Technical lemmas 

Lemma 12. Let X be a standard Gaussian random variable. The influence function defined 
in (|20p has the following properties: 

E[IF(X,Q,$)] =0, (83) 

E[XW(X,Q,$)] =0 , (84) 
ELY 2 1F(X, Q, $)] = (20F/3)" 1 exp(-l/(4c 2 )) ^ , (85) 

where $ zs i/te c.d./ o/ a standard Gaussian random variable, c = c($) is defined in fli7| ) and 
P = f<l>(y)<Kv + l/c)dy. 

Proof of Lemma\12l Let us first prove that E[LF(X, Q, $)] = 0. It is enough to prove that 
E[$pT + 1/c) - $(X - 1/c)] = 1/4. Using the definition of c, namely (JSSJ) or (J60]), we get: 

E[$(X + 1/c) - $(X - 1/c)] = / ($(x + 1/c) - $(x - l/c))4>{x)&x 

Jr 

H\y-x\<i/c}^x)^{y)dxdy = Ti(*)[l/c] = 1/4 . (86) 



Then, let us prove that E[XIF(X, Q, <&)] = 0. Since X has a standard Gaussian distribution, 
it suffices to prove that E[X{<i>(X + 1/c) — &(X — 1/c)}] = 0. By symmetry of <fi, we obtain: 



E[X$(X + l/c)]= / x$(x + l/c)(f)(x)dx = / x(l-$(-x-l/c))0(x)dx 

= - / x$(-x - l/c)(p(x)dx = E[X$(X - 1/c)] . 
Jr 

Finally, let us compute: E[X 2 IF(X, Q, <&)]. Set /? = f <p(y)<p(y + l/c)dy. By integrating by 
parts, using (|86|) and finally the symmetry of (j>, we get 



(/3/c)E[X 2 IF(X,Q,cD)] = - f I r +lA xV(x)dx ) <f>(y)dy + 1/4 

il \ Jy-l/c I 



■y+l/c \ 

{(y-l/c)0(y-l/c)-(i/ + l/c)0( ! / + l/c)}0( ! /)dy-/ I / 0(x)dx U(y)dy+l/4 



y-l/c 

{-(y - l/c)0(y - 1/c) + (y + l/c)<^(y + 1/c)} </>(y)dy 



where the last equality comes from j R ( fy-i/c <A( x )dx^ 0(y)dy = Ti($)(l/c) = 1/4. By sym- 
metry of (f), 



{-(y - l/c)</>(y - 1/c) + (y + l/c)0(y + 1/c)} <j>{y)dy = -2 / {y-l/c)<l>{y-l/c)</>{y)dy 

Jr 

= (2c v ^)- 1 exp(-l/(4c 2 )) , 
which concludes the proof. □ 
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Lemma 13. Let (X, Y) be a standard Gaussian random vector such that Cov(X, F) = and 
let and <3>_ denote the c.d.f. ofX + Y and X — Y , respectively. The influence function tp 
defined, for all x and y in R ; by 

y) = \ {Q($+) IF (x + y, Q, $+) - Q(<&_) IF (x - y, Q, d>„)} , 
satisfies the following properties: 

E[t/,(X,F)] = 0, (87) 
E[Xip(X, Y)] = E[Yip(X, Y)] = 0, (88) 
E[XYip(X, Y)] 7^ . (89) 

Proof of LemmaUM Using CE9|), ([BSD and Q($±) 2 = Var(X ± F) (see ©), we get that 
Ety(X, Y)] = \{Q{<5>+? - Q(<$>-f}E [W(U, Q, $)] = , 

where U is a standard Gaussian random variable, which gives (|87p . Let us now prove (|88p . 
First note that, 

E[X</,(X, F)] = i {E[(X + F)y,(X, F)] + E[(X - F)V(X, F)]} . 

But, 

E[(X + F)V>(X,F)] = ^E[Q($ + ) 2 (X + F)IF((X + F)/Q($+),Q,$) 

- Q($_) 2 (X + F)IF((X - F)/Q($_), Q, $)] 

= i [g($ + ) 3 E[C/IF([/, Q, $) - Q(cD_) 2 Q(cD + )E[C/IF(y, Q, $)] , 

where £7 = (X + F)/Q($+) and V = (X — F)/Q($_) are independent standard Gaussian 
random variables. By flgj, E[(X + F)^(X, F)] = 0. In the same way, E[(X -F)V>(X, F)] = 
which gives {(HED ■ Let us now prove dHHD- Using that 4XF = (X + F) 2 - (X - F) 2 , we get 

8E[XY^(X, F)] = E[(X + F) 2 Q($+)IF(X + F, Q, $+) + (X - F) 2 Q($_)IF(X - F, Q, $_)] 
- E[(X - F) 2 Q($+)IF(X + F, Q, $+) + (X + F) 2 Q($_)IF(X - F, Q, $_)] 
= (Q($ + ) 4 +Q($_) 4 )E[C/ 2 IF(C/,Q,$)]-Q($+) 2 Q($_) 2 (E[F 2 IF(C7, Q, $)] + E[U 2 IF(V, Q, $)]) 

where £/ and V are as above. The first term is non-zero by (|85p while the second term is zero 
by independence of U and V and (f83|) . This yields (f89"p . 

□ 

Lemma 14. Xei a P) g(r) = E[\^ X -Y\<r}Hp{X)Hp(Y)] where X and Y are independent stan- 
dard Gaussian random variables and H p is the pth Hermite polynomial with leading coefficient 
equal to 1. Then, 

(i) ai, (r) = 0, Vr G R 

(ii) a 2i0 (r) = -a 1( i(r) = 0(r/V2), Vr G R 

fiiij Moreover, there exists some positive rj such as a2,o(r) = — ai,i(r) is different from 
when r is in \tq — n; tq + 77], where ro is defined in M0\) . 
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Proof of Lemma \14\ The proof of (i) follows from the symmetry of the Gaussian distribution 
and the proof of (ii) relies on the following identity: for all r£R, 

/ ((f>(x + r) — 4>{x — r))x(j>(x)dx = <p(r/V2). 
Jr 

Let us now turn to the proof of (iii). ^>(r/v2) is equal to zero only if r = 0. By (|10p . r$ is 
such that < I ) (ro/v / 2) = 5/8, and hence is different from 0. The existence of n follows from the 
continuity of (f>. □ 



8. Conclusion 

In this paper, we studied the asymptotic properties of the robust scal e estimator CL 
(Ro usseeuw and Crouxl (|1993l )) and of the robust autocovariance estimator 7q (|Ma and Genton 
(|200(]h K j'or short and long-range dependent processes. We showed that the asymptotic vari- 



ance of these estimators is optimal, or close to it, and we verified, by using simulations, 
that these estimators are indeed robust in the presence of outliers. Complete proofs of the 
asymptotic properties of the robust scale Q n and the covariance estimator 7q are provided 
for Gaussian stationary processes. The central limit theorems for Q n and 7q were obtained. 
In all cases, the rate of convergence of the estimators is \fn, except for long-range depen- 
dent processes with D £ (0,1/2), for which the rate is n D L(n)~ 1 . Empirical Monte-Carlo 
experiments were conducted in order to illustrate the finite sample size properties of the es- 
timators. The robustness of Q n and jq were also investigated when the process contained 
outliers. The theoretical results and the empirical evidence strongly suggest the use of these 
estimators as an alternative to estimate the scale and the autocovariance structure of the 
process. The classical scale and autocovariance estimators were also considered as means of 
comparison. All estimators showed similar empirical accuracy when the data did not contain 
outliers. However, the classical estimators were significantly affected when additive outliers 
are present. The robust ones, however, were much less affected. 
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